INAOE at GeoCLEF 2008: A Ranking Approach based on Sample Documents

نویسندگان

  • Esaú Villatoro-Tello
  • Manuel Montes-y-Gómez
  • Luis Villaseñor Pineda
چکیده

This paper describes the system developed by the Language Technologies Laboratory of INAOE for the Geographical Information Retrieval task of CLEF 2008. The presented system focuses on the problem of ranking documents in accordance to their geographical relevance. It is mainly based on the following hypotheses: (i) current IR machines are able to retrieve relevant documents for geographic queries, but they can not generate a pertinent ranking; and (ii) complete documents provide more and better elements for the ranking process than isolated query terms. Based on these hypotheses, our participation at GeoCLEF 2008 aimed to demonstrate that using some query-related sample texts it is possible to improve the final ranking of the retrieved documents. Experimental results indicated that our approach could improve the MAP of some sets of retrieved documents using only an average of two sample texts. These results also showed that the proposed approach is very sensitive to the presence of irrelevant sample texts as well as to the ambiguity of geographical terms.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The XLDB Group at GeoCLEF 2005

This paper describes our participation at GeoCLEF 2005. We detail the main software components of our Geo-IR system, its adaptation for GeoCLEF and the obtained results. The software architecture includes a geographic knowledge base, a text mining tool for geo-referencing documents, and a georanking component. Results show that geo-ranking is heavily dependent on the information in the knowledg...

متن کامل

Re-Ranking for Geo-Relevance With Non-Contextual Heuristics at GeoCLEF 2007

Geographic Information Retrieval (GIR) in an attempt to improve relevance by taking geographic information in textual documents into account. We describe out experiments carried out at the GeoCLEF 2007 evaluation [1] that investigate further the role of geo-filtering based re-ranking and query expansion with geographic terms. Our main findings are that manual query expansion with geo-terms is m...

متن کامل

The University of Lisbon at GeoCLEF 2006

This paper details the participation of the XLDB group from the University of Lisbon at the GeoCLEF task of CLEF 2006. We tested text mining methods that make use of an ontology to extract geographic references from text, assigning documents to encompassing geographic scopes. These scopes are used in document retrieval through a ranking function that combines BM25 text weighting with a similari...

متن کامل

GeoCLEF 2008: the CLEF 2008 Cross-Language Geographic Information Retrieval Track Overview

GeoCLEF is an evaluation initiative for testing queries with a geographic specification in large set of text documents. GeoCLEF ran a regular track for the third time within the Cross Language Evaluation Forum (CLEF) 2008. The purpose of GeoCLEF is to test and evaluate cross-language geographic information retrieval (GIR). GeoCLEF 2008 consisted of two sub tasks. A search task ran for the third...

متن کامل

The University of Lisbon at GeoCLEF 2008

This paper reports the participation of the XLDB team from the University of Lisbon at the 2008 GeoCLEF task. We focused on developing a better text annotation tool for geo-parsing the documents, handling both explicit geographic evidence (as given by placenames) and implicit geographic evidence (as given by monuments, for example). The query processing and geographic ranking approaches were re...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008